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The asymptotic properties of the spectrum of non symmetrically 
perturbed Jacobi matrix sequences 


Leonid Golinskii and Stefano Serra-Capizzano ^ 


Abstract 


Under the mild trace-norm assumptions, we show that the eigenvalues of a generic (non 
Hermitian) complex perturbation of a Jacobi matrix sequence (not necessarily real) are still 
distributed as the real-valued function 2cost on [0,7r] which characterizes the nonperturbed 
case. In this way the real interval [—2, 2] is still a cluster for the asymptotic joint spectrum 
and, moreover, [—2,2] still attracts strongly (with infinite order) the perturbed matrix se¬ 
quence. The results follow in a straightforward way from more general facts that we prove 
in an asymptotic linear algebra framework and are plainly generalized to the case of matrix¬ 
valued symbols, which arises when dealing with orthogonal polynomials with asymptotically 
periodic recurrence coefficients. 
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1 Introduction and preliminary discussion 

Consider the matrix of size n defined as 


0 1 
1 0 1 


1 


( 1 ) 


•. 1 

1 0 


The former matrix is the Toeplitz matrix T„(a) generated by a(t) = 2cost in the following 
sense: given a Lebesgue integrable function b defined on [—tt, tt) (and periodically extended on 
M), the matrix T„(6) has order n and entries (7L(6))p^ = bp-g, p,q = 1,... ,n. Here bj is the 
j-ih. Fourier coefficient of 6, i.e., 



In the specihc case o the eigenvalues are explicitly known, and they coincide with the evaluation 
of a{t) on the uniform grid j'K/{n + l) on [0, vr]. If is replaced by a more general Jacobi matrix 


ho ai 
ai bi 02 


( 2 ) 


1 

1 bn—I 
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where aj £ M tends to 1 and bj £ M tends to 0 as j —> oo, then its eigenvalues are no longer 
explicitly known, but they are again an approximation of the evaluation of a{t) on the same 
grid. This result can be obtained directly from the GLT theory (see and more precisely, 

VF £ Co(C) (Co(C) is the set of all continuous functions having a bounded support), we have 


lim — F{X) = — / F{2cost)dt = — / F{2 cos t)dt. (3) 


Here and in what follows stands for the collection of all eigenvalues of J„ counted with 
their multiplicity, the function 2cost is also called the symbol of {Jn}, and we write {Jn} ~a 
( 2 cos t, [—vr, vr]). In the orthogonal polynomials community this result is known, often under the 
unnecessary condition aj > 0, in the form 


lim — F(xj r. 

n—^oo Ti ’ 


j = l 


1 F{x)dx 
vr y_2 \/4 - ’ 


that is the weak*-convergence of the counting measures of the zeros {xj^n}'j=i of orthonormal 
polynomials {pn} to the equilibrium measure of the support of the orthogonality measure (see, 
e.g., ^1 Section 4.9], ^3 Chapter 2]). Observe that the set of zeros {xj^n}^=i is exactly the set 
considered in the left-hand side of (H- 

Let us set up the formal definitions. For any function F defined on C and any matrix An 
of size n, with the eigenvalues Xj{An), j = 1,..., n, the symbol Sa(F, An) stands for the mean 

1 " 1 

S,(F,H„) :=-J2FiXjiAn)) = - F{X). 

J—1 A£Sn 

A generic sequence of matrices {An} ■= {An}n (^n of size n) will be referred to as a matrix 
sequence. 

Definition 1.1. A matrix sequence {An} is distributed {in the sense of the eigenvalues) as a 
measurable function 9, defined on a set G C of finite and positive Lebesgue measure m{G), 
if VF £ Co(C), the following limit relation holds 

\hn J:^{F, An) = ^— [ F{9{t))dt. (4) 

n^oo m(G) Jg 

In this case we write in short {An} ~a (^jG). Moreover, two sequences {An} and {Bn} are 
equally distributed if VF £ Co(C), we have 


lim [Sa(F,F„) - Sa(F, A„)] = 0. 

n^oo 


( 5 ) 


Note that two sequences having the same distribution function are equally distributed. On 
the other hand, two equally distributed sequences do not need to have a distribution function. 
However, if one of them has a distribution function then the other necessarily shares the same 
distribution: the derivation is immediate from the definitions (for an example see |14[ Remark 
6 . 1 ]). 

Along with the distribution in the sense of eigenvalues (weak*-convergence) we will study 
another asymptotic property of the spectra called here the clustering. 
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Definition 1.2. A matrix sequence {An} is properly {or strongly) clustered at s € C (in the 
eigenvalue sense) if for any e > 0 the number of the eigenvalues of An off the disk 

Zl(s, e) := {z : \z — s\ < e} 

can be bounded by a pure constant qs possibly depending on e, but not on n. In other words 

qe{n,s) := i}^{Xj{An) : Xj ^ D(s,e)} = 0(1), n ^ oo. 

If every An has, at least definitely (that is, for all large enough n), only real eigenvalues, then s 
is real and the disk D{s, e) reduces to the interval (s — e, s + e). Furthermore, {An} is properly 
{or strongly) clustered at a nonempty closed set 5 C C (in the eigenvalue sense) if for any e > 0 

qe{n,S) := i^{Xj{An) : Xj ^ D{S,e) :=Us(^sD{s,e)} = 0{1), n^oo, ( 6 ) 

D{S, e) is the e-neighborhood of S, and if every An has, at least definitely, only real eigenvalues, 
then S has to be a nonempty closed subset of M. Finally, the term “properly (or strongly)” is 
replaced by “weakly”, if 


qs{n,s) = o{n), {qs{n, S) = o{n)), n ^ oo, 
in the case of a point s (a closed set S), respectively. 

It is clear that {An} ~a (^jG) with 9 = s a constant function is equivalent to {An} being 
weakly clnstered at s £ C (for more results and relations among the notions of equal distribution, 
eqnal localization, spectral distribution, spectral clustering etc., see [Til Section 4]). 

We will primarily be interested in the special situation, when are viewed as n x n 
principal blocks of an infinite Jacobi matrix J^o (background), Poo is a complex Jacobi matrix 
(perturbation), Aoo = Joo + Poo and An = Jn + Pn are the nxn principal blocks of A^o (so An+i 
is the one step extension of An). In fact, the main results hold in much more general setting 
when no relation between An+i and An is presnmed. 

The main conditions we impose on Poo are of two types. 

(i) . ||Pn||i = o{n) as n ^ oo, where || • ||i is the trace-norm of a matrix (i.e., the sum of its 
singular values, see |1]). This condition is equivalent to 

1 ” 

+ IPi.i+il) =0, Poo = {pj,k}'^k=i- (7) 

i=i 

The latter means the Cesaro convergence of the entries of Poo to zero. Poo is now called the 
Cesdro compact Jacobi matrix (cf. [HlIZl)- 

(ii) . ||Pn||i = 0(1) as n ^ oo, that is, 

n 

limsup V(|pjj_i| -b \pjj\ + \Pj,j+i\) < oo, (8) 

n^oo . 1 

and so Aoo is the trace class perturbation of Joo- 

We point out that the trace-norm is useful in the theoretical derivations while the conditions 
on the entries are easy to check in practice. Moreover, the equivalence of the trace-norm and 
entry-wise conditions in (i) and (ii) is well known (cf. [2l Section 2]). Nevertheless we give the 
proof in Appendix for two reasons, i.e., because we get better equivalence constants, for the sake 
of completeness, and because the proposed matrix-theoretic proof is new and elementary. 
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We proceed as follows. In Section [51 the relation between the distribution in the sense of 
eigenvalues, clustering and attracting properties of matrix sequences is discussed. Our main 
results are stated and proved in Sectional In particular, Theorems Id.41 and Id.51 allow to study 
non-Hermitian perturbations of Hermitian matrix sequences. As a straightforward consequence 
we obtain the clustering for zeros of the system of polynomials satisfying the three-term re¬ 
currence relation with complex coefficients. Finally, in Section jl] we examine the case of block 
Toeplitz and asymptotically periodic Jacobi matrices, and in Section Owe discuss further ex¬ 
tensions and generalizations. 


2 Clustering and attracting 


Let us recall the notion of the essential range which plays an important role in the study of 
asymptotic properties of the spectrum. 

Definition 2.1. Given a measurable complex-valued function 9 defined on a Lebesgue measur¬ 
able set G, the essential range of 9 is the set S{9) of points s e C such that, for every e > 0, the 
Lebesgue measure of the set 9^~^\D{s,e)) := {t € G : 9{t) G D{s,e)} is positive. The function 
9 is essentially bounded if its essential range is bounded. Finally, if 9 is real-valued, then the 
essential supremum (infimum) is defined as the supremum (infimum) of its essential range. 


S{9) is clearly a closed set (its complement is open), and moreover 
s(«)=n{B — closed set : m{9^ = ^(G)}, 

where m(X) is the Lebesgue measure of a set X. 

In the case of a bounded in the operator norm sequence {An}, a further mathematical 
instrument that we need is a way for relating formula (@J), with F a generic polynomial, to the 
same formula in its full extent, i.e., with F being a continuous function. The answer is partly 
contained in the Mergelyan Theorem and not completely positive. We need assumptions on 
the essential range of the symbol 9 and a priori assumptions on the clustering properties of the 
sequence {An}. The reason is in part due to the barrier given by the Mergelyan Theorem stating 
that the closure in the uniform norm of the polynomials on a compact set S is given by the 
set of all continuous functions on S which are holomorphic in its interior, provided that C\S is 
connected (for the proof see ^1 Theorem 20.5, pp. 423-427]). Therefore the polynomial space 
is able to approximate every continuous function on S if and only if S has the empty interior 
and C\5' is connected. 

Theorem 2.2. Assume that a matrix sequence {An} is weakly clustered at a compact set S <Z C 
with the connected complement, and the spectra are uniformly bounded, i.e., |A| < G, X G 
for all n. Assume further that 0 holds with F a generie polynomial of an arbitrary fixed degree, 
and the essential range of 9 in eontained in S. Then relation 0 is true for every eontinuous 
funetion F with a bounded support which is holomorphic in the interior of S. Moreover, if the 
interior of S is empty, then {An} is distributed as 9 on its domain G. 


Proof. In the argument we follow Tilli (see cni, the proof of Theorem 3). Take F continuous 
over S and holomorphic in its interior. By the Mergelyan Theorem, for every e > 0, we can find 
a polynomial p such that \p{z) — F{z)\ < e for every z G S. Since the essential range of 9 is 
contained in S, it is clear that \p{9{t)) — F{9{t))\ < £ a.e. in its domain G. Therefore 


1 

m{G) 



F{9{t))dt 


m{G) 


'G 


p{9{t))dt 


< 


m{G) 


dt = £. 


(9) 
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Next, we go over to the left-hand side of ©• By the definition of clustering for any fixed e' > 0 
we have 

#{A e |A - ^1 > e', Vz G 5} = #{A G A ^ D{S,£')} = o{n). 

Moreover, by the hypothesis of the uniform boundedness of |A| < C for every A G with 
a pure constant C independent of n. Therefore, by extending F outside S in such a way that it 
is continuous with a bounded support, we have 


E 


F(\) 


AeS„, \^D{S,e') 

AeS„, \^D{S,e') 


<—#{AgS„, X^DiS,e')} = oil), 
n 


<—#{AgS„, \iD{S,e')] = o{l), 
n 


with M = max(||T||oo, ||p||oo); and the infinity norms are taken over {z G C, \z\ < C}. Conse¬ 
quently, by setting A = \T,x{F — p, An)\ we have 


A = 


1 


n 


Y,{fW-pW) 


Aes„ 


ir(A)-p(A)| 


AGSt] 


< - 


|F(A)-p(A)| + - |F(A)-p(A)| 

Aes„, AeD(5,e') AgS„, \^D(S,e') 

^ |F(A)-p(A)|+»(l) 

Aes„, XeD{S,e') 


- Y1 \fW-pW\ + - E 

AGSn, A€5 AGSn, \^D{S,e')\S 


|F(A)-p(A)|+o(l). 


For A G S' we use |T(A) — p(A)| < e, and for A G D{S,£')\S we write 

|T(A) - p(A)| < |F(A) - F(A0| + |T(A0 - p(A0| + |p(A') - p(A)|, |A - A'| < e', A' G 5, 


so that |F'(A) — p(A)| < ci(e') -t- e -1- C 2 (e,e') = 0{£,£') with 


lim lim 0(e, s') = 0. 
e^Oe'^O 

(10) 

Hence 


A < e -|- O^s^s') F o(l)- 

(11) 

Moreover, from the hypothesis of the theorem we have 


hmSA(p,^n)= [ p{0{t))dt. 

n^oo m[G) Jg 

(12) 


Since e and e' are arbitrary, it is clear that relations ©-dm) imply © to hold for F as well. 
Finally, when S has empty interior, we have no restriction on F except for being continuous 
with a bounded support, and therefore what we have proved is equivalent to {An} ~a {e,G). □ 

To proceed further, we need a notion which is essential in the orthogonal polynomials 
theory. 
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Definition 2.3. A matrix sequence {A^} is strongly attracted s £ C if 


lim dist(s, = 0, 


( 13 ) 


where dist(A, y) is the usual Euclidean distance between two subsets X and Y of the complex 


plane. Furthermore, let us order the eigenvalues according to its distance from s, i.e., 

|Al(A„) — s| < \\2{An) — s| < • • • < \Xn{An) — s|. 

We say that the attraction is of order r(s) £ N, r(s) > 1 is a fixed number, if 
lim |Ar( 5 )(A„) - s| = 0, liminf - s| > 0. 

The attraction is of order r(s) = oo if 

lim \Xj{An) — s| = 0 

for every fixed j. Finally, the term “strong or strongly” is replaced by “weak or weakly” if lim 
is replaced by lim inf in m- 

It is not hard to ascertain, that if {An} is at least weakly clustered at a point s, then s 
strongly attracts {An} with infinite order. Indeed, s is an attracting point of finite order implies 


#{A£S„: X^D{s,6)} 


lim 


n 


for some <5 > 0, that is impossible in the case when {An} is weakly clustered at s. On the other 
hand, there are sequences which are strongly attracted by s with infinite order but not even 
weakly clustered at s. 

The notions previously introduced in this section are intimately related, as emphasized in 
the following theorem. 

Theorem 2.4. Let 6 he a measurable function defined on G with finite and positive Lehesgue 
measure, and S = S{9) the essential range of 9. Let {An} be a matrix sequence distributed as 9 
in the sense of eigenvalues. Then 

a) S{9) is a weak cluster for {An}; 

b) each point s £ S{9) strongly attracts F„ with infinite order r(s) = oo. 

Proof, a). Given e > 0, we apply with the test function of the form 



1, for z £ D{S,s/2) n D{0,l/e), 

0, ioi z £C\{D{S,s)nD{0,2/e)), 


0 < Fe < 1. 


It is clear that 


An) < 


#{X £ Yn : X £ {D{S,e) n D{0,2/e))} 


n 


#{X£Yn:X£DiS,e)} qfin, S) 


n 


n 


qe{n, S) is defined in Q, and so 


liminf Sa(F£, A„) < 1 — lim sup ^ ^ 
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By the assumption there exists 


lim J:x{Fe,An) = 


n^oo 


m{G) 


Ig 


,, > .«{0<-‘)(P(5,./2)nD(0,l/.))) ^ 

m{G) 


We have 


{D{S, e/2) n D{0, 1/e)) = {D{S, e/2)) n 6^-^^ {D{0, 1/e)) = B, n A, 

and hence 

1 V qs{n,S) miT^nAe) 

1 —hmsup->-—7-. (14) 

n^oo n m{G) 

By the definition of the essential range the right-hand side in (d tends to 1 as e —> 0, and so 

lim„^oo n~^qs{n, S) = 0, as needed. 

b). Let s G S and e > 0. Construct L/ by 

|0, for z G C\(Zl(s, 2e)), 

Since is dominated by the characteristic function of D(s, 2e), we see that 

^{A G : A G D(s, 2e)} 


n 


^ '^\{Fe, An). 


But {An} ~A {9, G), and so by employing F^ as the test function we have 

m{9^~^'>{D{s, e)) 


lim T.x{Fe,An) = 


n—>co 


m{G) 


'G 


Fs{9{t))dt> 


m 


(G) 


since L/ dominates the characteristic function of D{s, e). By the definition of the essential range 
the right-hand side is strictly positive and hence 


lim inf : A G L>(s,2£)} 


> 0 . 


n 


The latter means exactly that s attracts with order r{s) = oo, as was to be proved. □ 

The final result of this Section demonstrates the stability of the clustering under certain 
perturbations (cf. m Corollary 4.1]). 

Proposition 2.5. Let {Xn} and {W} be two Hermitian matrix sequences, at least definitely, 
M a closed subset of the real line, and assume that \\Xn — Will = o(n) {\\Xn — W||i = 0(1)). 
Then {Xn} is weakly {strongly) clustered at M if and only if so is {W}- 


Proof. Let Xj{Xn), XjiYn) be the eigenvalues of Xn and Yn, respectively, labelled in the decreas¬ 
ing order. For an arbitrary e > 0 we introduce three sets of indices 

I{Xn,e) = {j = l,2,...,n:dist{Xj{Xn),M)>e}, 

I{Yn,e) = {j = 1,2,... ,n : dist(Aj(W),AL) > e}, 

I{Xn,Yn,e) = {j = 1,2,... ,n : \Xj{Xn) - Xj{Yn)\ > e}. 

Denote by |/(A„,e)|, |/(W,e)| and \I{Xn,Yn,e)\ their cardinalities. It is clear that 

I{Xn, e) Cl [Xn, Yn, |) (J ^ |) , 
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and so 


\i{Xn,e)\ < |/(x,,yn,|)| + |/(i;x,|) 


According to the Lidskii-Mirsky-Wielandt Theorem (see jSl Theorem IV.3.4 and Example 
IV.3.5]) 

n 

Y, |A,(V„) - A,(V„)| < ||A„ - y„||i, 


i=i 


so that 


Hence 


e |/(x„,y„,e)| < ^ |A,-(A„)-A,-(y„)| < ||x„-y„||i. 

ie/(x„,y„,£) 


\I{Xn,e)\ < ^||A„-y„||i + |/(y„,|) 


The rest is plain. 


□ 


3 Non Hermitian perturbations of Hermitian matrix sequences 


First we recall the definition of real and imaginary parts of a matrix. Given a square matrix A, 
we define Re(A) and Im(A) as {A + A*)/2 and {A — A*)/{2i), respectively, where X* denotes the 
conjugate transpose of the matrix X. In this way, in analogy to the complex field, we naturally 
have A = Re(A) + f Im(A). 

The result below is the well-known Ky Fan-Mirski Theorem (see, e.g., [31 Proposition 
III.5.3]). 

Theorem 3.1. Let A be a square matrix of size n, and Xj{A), Aj(Im(A)) the eigenvalues of 
A and Im(A), respectively, labelled in the decreasing order, so that Im(Ai(A)) > Im(A 2 (A)) > 

• • • > Im(An(A)) and Ai(Im(A)) > A 2 (Im(A)) > • • • > An(Im(A)). Then 

<? <? 

^Im(Aj(A)) < ^ Aj(Im(A)), q = l,...,n, (15) 

i=i i=i 

and the equality prevails for q = n. 

Equivalently, let Aj(A) and Aj(Re(A)), j = 1,... ,n, be the eigenvalues of A and Re(A), respec¬ 
tively, labelled in the decreasing order, so that Re(Ai(A)) > Re(A 2 (A)) > ••• > Re(An(A)) and 
Ai(Re(A)) > A 2 (Re(A)) > • • • > An(Re(A)). Then 

<? <? 

^Re(Aj(A)) < ^ Aj(Re(A)), q = l,...,n, (16) 

i=i i=i 

and the equality prevails for q = n. 


The next statement provides a simple bound for the number of nonreal eigenvalues of a 
matrix A. In what follows F(V) always stands for the set of all eigenvalues of a matrix X: 
S(V) = {A,(V)}-=,. 








Lemma 3.2. Let A = Re(^) + ilm(^). Then for an arbitrary e > 0 


#{AGS(^):|Im(A)|>e}< 


l|Im(AL)||i 

e 


(17) 


Moreover, if for some real c, d we have c < Aj(Re(^)) < d for all j, 
and 


#{A G S(Al) : A ^ D{[c,d],e)} < 


then c < Re(Aj(^)) < d 
(18) 


Proof. Denote by 


m+ := 

AeS(Im(yl)), A>o 



E i^i] 

AeS{Im{A)), A<0 / 


the positive (negative) mass of the eigenvalues of Im(^). Since Im(A) is Hermitian, its trace- 
norm equals the sum of the absolute values of its eigenvalues, so ||Im(74)||i = m'^ + m~. We 
apply the first part of Theorem 13.II for A and —A to obtain 


r'^ := Im(A) < m"*", r := |Im(A)| < m . (19) 

AgS(A), Im(A)>0 AgS(A), Im(A)<0 

Therefore, if we take an arbitrary e > 0, the number of the eigenvalues of A whose imaginary 
part is bigger that e has to be bounded by ||Im(^)||i/e. Indeed, 


||Im(74)||i = m’*'-|-m > r"*"-|-r = ^ |Im(A)| > ^ |I™(^)I 

AeE(A) AeS(A), |Im(A)|>e 

> ^ e = e • #{A G E(A), |Im(A)| > e}, 

AeS(A), |Im(A)|>£ 


as needed. 


Next, let A be an eigenvalue of A corresponding to an eigenvector x. Then 

^ _ xMx _ x*Re{A)x .x*lm{A)x 

x*x x*x x*x 

which implies that Re(A) G [c,d], since, by the assumption, every eigenvalue of Re(A) belongs 
to [c, d\. So m follows from CZI). □ 

Corollary 3.3. Let {An} be a matrix sequence such that ||Im(^„)||i = o(re) as n ^ oo. Then 
qc{n,M.) = o(n), so {An} is weakly clustered at M. Moreover, if all the eigenvalues ofRe{An) are 
in [c, d], then all the eigenvalues of An have real parts in the same interval and qe{n, [c, d]) = o(n). 
The same result holds if o{n) is replaced by 0(1) and “weakly clustered” by “strongly clustered”. 


The following result establishes a link between distributions of the Hermitian sequence 
{Re(H„)} and the sequence {An}. As a matter of fact, we will prove a more general statement 
concerning non-Hermitian perturbations of Hermitian matrix sequences. As usual, ||Ai|| stands 
for the operator (spectral) norm of a matrix X. 

Theorem 3.4. Let {Bn} and {Cn} be two matrix sequences. Bn is Hermitian, at least defi¬ 
nitely, and An = Bn + Cn. Assume further that {Bn} is distributed as {9,G), G of finite and 
positive Lebesgue measure, both ||H„|| and HOnll are uniformly bounded by a positive constant C 
independent of n, and ||0„||i = o{n), n —> oo. Then 0 is real valued and {An} is distributed 
as {d,G) in the sense of the eigenvalues. In particular, if S{0) is the essential range of 9, then 
{An} is weakly clustered at S{9), and S{9) strongly attracts the spectra of {An} with infinite 
order of attraction for any of its points. 
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Proof. Denote by trX the trace of a matrix X, that is, the sum of its diagonal entries (or the 
sum of its eigenvalues) 

n 

iTX= ^ A = ^(X)fc,fc, 

AgS(X) k=l 

SO ixAn — irBn = trC^. As |trA| < ||A||i, the assumption on the trace-norm of Cn yields 

- V A = - V A + o(l). 

Aes(u„) Aes(s„) 


The latter is closely related to (@J) with F{z) = z (defined over the whole C). Since {Bn} is 
distributed as 6 over G, we infer by 


lim — 

n—^oo fl 


E X = lim — 

n^oo fl 

AgS(A.,) 


E ^ 

ags(b„) 


ljm)dt, F(.) = z, 


( 20 ) 


where we are allowed to take F(z) = z (which has an unbounded support), since by the premises 
of the Theorem ||A„|| < 2C for all n, and so the spectra of {An}, {Bn}, and {Cn} are all 
contained in the closed disk {|z| < 2(7}. Relation (01|) can be viewed as the first step from a 
distribution relation for the Hermitian sequence {Bn} to the same distribution relation for the 
sequence {An}. The next step is to extend (1201) to the case when F is an arbitrary polynomial 
of a fixed degree. By the linearity it suffices to consider only monomials. Clearly, for any fixed 
nonnegative integer q, the matrix An can be written as An = Bn+Rn,q and, thanks to the Holder 
type inequalities for the Schatten p norms ||Ay||i < ||A|| • ||T||i (see jSl Corollary IV.2.6]) we 
have ||R„^g||i = o{n) as n ^ oo. Therefore, by repeating the same reasoning as above we deduce 


hm — A^ = hm — 

n—^■oo fl n—>oo fi 

A€E(An) A€E(Sn) 




F{e{t))dt, 


F{z) = zT 


( 21 ) 


To go over in m from polynomials to arbitrary continuous functions with bounded support 
we would like to invoke Theorem o So let us make sure that the rest of its hypothesis is 
satisfied. As we have already mentioned, ||A„|| < 2C for all n. Next, it is clear that 

||Re(C0||i < liaili = o(n), ||Im(C0||i < ||a||i = o{n) (22) 

as n ^ oo. Write An = Bn + Re(C„) + ilm((7„). By Theorem 12.41 {B..,\ is weakly clustered at 
S{9), and so is {Re(A„) = Bn+Be{Cn)} by ProDosition l2.5[ Note that S{9) is now a compact set 
which lies in the interval [—2(7, 2(7], and all the eigenvalues of Re(A„) are in the same interval. 
Corollary ESI now claims that {An} is weakly clustered at [—2(7, 2(7] D S{9), and the application 
of Theorem 12.21 completes the proof. □ 

The following theorem deals with the case of the strong clustering. 

Theorem 3.5. Let {Bn} and {Cn} be two matrix sequenees, Bn is Hermitian, at least definitely, 
and An = Bn + Cn. Assume that {Bn} is strongly clustered at [c,d], ||(7n||i = 0(1), n — > oo and 
||A„|| is uniformly hounded by a positive constant C independent of n. Then {An} is strongly 
clustered at [c, d]. 


Proof. Since now ||Re((7,i)||i = (7(1) and ||Im(C„)||i = 0(1), both the related sequences are 
strongly clustered at zero by Proposition 12.5L A repeated application of the same proposition 
shows that {Bn + Re((7,i)} is strongly clustered at [c,d]. Although we don’t have the right 
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to invoke Corollary Kli-il at this point, since the eigenvalues of Re(A„) = + Re(C'„) are not 

necessarily in [c, d], we can follow a direct approach stemming from Theorem lihll 

Since ||^n|| < C, the real part of any eigenvalue of An belongs to [—C,C] and the same 
is true for any eigenvalue of Re(^ri)- For e > 0, let be the number of eigenvalues of An 

whose real parts are below c — e, and analogously, let (e) be the number of eigenvalues of Xn 
whose real parts exceed d + e. We want to prove that both and (e) can be bounded by 

a constant possibly depending on e, but independent of n. By m we have 

qi (e) git (e) 

^ Re(A,(^0) < Aj(Re(Al„)) 
i=i i=i 


with 


Re(Ai(A„)) > Re(A 2 (^n)) > • • • > F,e(Ag+^^^(^„)) > d + e > Re(Ag+^^^^^(A„)). 


Therefore 


gitO 

(d + e)g+(e) < Xj{Re{An)). 

j=i 


(23) 


Thanks to the strong clustering of Re(^„) = i?„ + Re(C',i), for every e' > 0 there exists a positive 
constant K{e') independent of n such that the number of eigenvalues of Re(A„) not belonging 
to (c — e',d + e') is bounded by K{e'). Consequently, we infer 


gitC) 

^ Xj{Re{An)) < CK{e') + {d + e')iqUe)-KiE')y 




(24) 


with (x)"*" = (x + |x|)/2. Putting together (1^ and (1^ . by choosing e' = e/ 2 , we finally deduce 


Qni^) < 


2CK{e/2) 

E 


where, as requested, the right-hand side is independent of n. A similar reasoning on —Xn gives 
the same bound on qn{£), as claimed. 

As for the imaginary parts of the eigenvalues of An, we can apply directly (EH). The proof 
is complete. □ 

The latter result can be extended to the case of clustering at several intervals, the situation 
we will encounter later in Theorem 


Theorem 3.6. Let {Bn} and {Cn} he two matrix sequences, Bn is Hermitian, at least definitely, 
and An = Bn+Cn- Let E he a union ofm disjoint closed intervals [possibly, degenerate). Assume 
that {Bn} is strongly clustered at E, ||C,i||i = 0(1), n ^ oo and ||A„|| is uniformly bounded by 
a positive constant C independent of n. Then {An} is strongly clustered at E. 


Proof. We reduce this statement to the previous one. Denote 

E — Uj—fioj, bfi, ^ b-^ 02 ^ 62 ^ . . . ^ Oni ^ bni, 

and put T[z) = YlJ[^i[z — aj){z — bj). Obviously, T[E) G [^,0], oj = min 3 ;T(x) < 0, and 
T[x) > 0 for X G M\E. By the Spectral Mapping Theorem (see e.g. [Up. 5]) i? is a strong 
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cluster for {B^} yields [w,0] is a strong cluster for {T(i?„)}. Next, by the hypothesis of the 
theorem and the Holder type inequalities for the trace norm 


T{An) = T{Bn) + Rn, l|-Rn||l = 0(1), U ^ OO. 


We have the right to apply Theorem Id .51 to the matrix sequences {T{An)}, {T{Bn)} to conclude 
that {T{An)} is strongly clustered at [w,0]. The repeated application of the Spectral Mapping 
Theorem completes the proof. □ 


Let us go back to the Jacobi matrix sequences described in the introduction. Let 


A 


OO 


6o Cl 
ai bi C2 

«2 ^2 C3 


be an infinite complex non symmetric Jacobi matrix with the bounded entires 

sup (lOnl + |6n| + |Cn|) < O < 00 . (25) 

n 

As a simple consequence of Theorem EM we can prove the following 
Corollary 3.7. Let A^o be the Cesaro compact perturbation of J^, that is, 

1 

lim — y^(|l - a,| + \bj\ + |1 - cA) = 0, (26) 

n^oo Ti ^ 

and {An} its principal n x n blocks. Then {An} is distributed as (2cost, [—vr,vr]) in the sense 
of eigenvalues, weakly clustered at [—2,2], and [—2,2] strongly attracts the spectra of {An} with 
infinite order of attraction for any of its points. 

Proof. We apply Theorem 13.41 with Bn = C„ = An — Jn- {Bn} is clearly distributed as 
(2cost, [—TTjVr]). Inequality (12511 provides the uniform boundedness of ||A„|| and [[Cnll. Finally, 
(uni) is equivalent to HCnlli = o(n), and the result follows. □ 

Corollary 3.8. Let A^o he trace class perturbation of Joo, that is, 

n 

lim sup (11 — af + \bj\ + |1 — Cj\) < oo. (27) 

n—^oo . . 

Then {An} is distributed as (2cost, [—vr, vr]) in the sense of eigenvalues, strongly clustered at 
[—2,2], and [—2,2] strongly attracts the spectra of {An} with infinite order of attraction for any 
of its points. 

Proof. The only point to be proved is that the weak cluster is also strong, and this is implied 
by Theorem 13.51 □ 

If we are concerned only about the clustering of the spectrum, the more elementary Corol- 
larv 13. di does the job. In this case the assumption Im(Aoo) being a Cesaro compact perturbation 
of that is, 

1 ” 

n 
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already guarantees that {A^} is weakly clustered at M. Moreover, if all the eigenvalues of Re(A„) 
are in [c,d], then {An} is weakly clustered at [c,d]. 


It is worth pointing out that T,(An) now agrees with the set of all zeros of the polynomial 
Pn which satisfies the three-term recurrence relation 


zpj{z) = ajPj-i{z) + bjPj{z) -h Cj+ipj+i{z), j E Z+ 


(28) 


P-i = 0, Po = 1. Such polynomials are studied systematically in the theory of Fade approxima¬ 
tions and continued J-fractions. More precisely, pn is the denominator of the nth diagonal Fade 
approximant and its zeros are the poles of this Fade approximant. In turn, the closed interval 
[—2, 2] is now the essential spectrum of the bounded operator ^oo in 

Remark. In PEI the authors studied the attracting properties of the spectrum of ^oo in 
the case when ^oo is a compact perturbation of Jqo- The celebrated theorem of H. Weyl claims 
that S(Aoo) = [—2,2] U T,d{Aoo), where the discrete spectrum Sd(74oo) is at most denumerable 
set of eigenvalues Xj{Aoo) of the finite algebraic multiplicity Vj, off the essential spectrum [—2,2], 
It is proved in PE] that each Xj is the attracting point of T,{An) of order Uj. Our result in 
Corollary 18.71 supplements this one nicely. Note that in the case (uni the Weyl theorem is only 
partly true (see [3 Theorems 7 and 9]): still [—2, 2] C S(^oo), but in general there is no discrete 
part of the spectrum any more. 

4 Asymptotically periodic Jacobi matrices: the block case 

We start out with the definition of the spectral distribution with matrix-valued symbols. 
Throughout the rest of the paper 0 will stand for a k x k matrix-valued and the Lebesgue 
integrable function (i.e., all its entries are integrable) with the eigenvalues Xj{9), j = 1,2, ... ,k. 

Definition 4.1. Let 9 he a kxk matrix-valued the Lebesgue integrable function defined on a set 
G of finite Lebesgue measure. A matrix sequence {An} has the asymptotic spectral distribution 
9 if for all F ^ Cq one has 



As in the scalar case, we write in short {An} ~a {9,G). 


Under the essential range of 9 we mean now the set 


k 


S{9) := |jRange(A,(0)). 


The same argument as applied above in the proof of Theorem 12.41 leads to the following result. 


Theorem 4.2. Let 9 he a kxk matrix-valued, the Lebesgue integrable function defined on a set 
G of finite Lebesgue measure and S = S{9) he the essential range of 9. Let {An} be a matrix 
sequence distributed as 9 in the sense of eigenvalues. Then 


a) S{9) is a weak cluster for {An}; 

b) each point s E 5(0) strongly attracts with infinite order r{s) = oo. 
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Definition 4.3. Let 6 be a A: x /c matrix-valued and the Lebesgue integrable function defined 
on [—vr, vr) with the Fourier coefficients 

bj = f b{t) exp{—ijt) dt E j E Z. (29) 

J-n 

The function b is called the generating function of the sequence of block Toeplitz matrices 


Tn{b) 


bo b-i ■ ■ ■ bi-n 
bi bo ': 

J ■ • • • b.i 

bn-i • • • bi bo 


0 (^knxkn 


It is easy to observe that T„(6) is Hermitian for every n if and only if its generating function 
b is Hermitian for almost every t E [—vr, tt), and the index n here denotes the block order. 

Let us define a matrix sequence {Tm(6)} by the following recipe: T^n '■= Tn-, and T^n-j is 
obtained from by deleting the last j rows and columns for j = 1, 2,..., A: — 1. In other words, 
Tm is the principal m x m block of the infinite block-matrix Too{b) = {bp-q}^^^Q. 

The following general result due to Tilli (see m) is very important in our context. 


Theorem 4.4. Ifb is any Hermitian-valued and absolutely integrable function on [—7r,7r] 



dt < -|-oo, 


where || • || is any matrix norm in then {Tm{b)} ~a tbe sense of Defini¬ 

tion o 


Now we turn to the case of asymptotically periodic Jacobi matrices. Let 



,( 0 ) 

( 0 ) 


,( 0 ) 

:(o) 

JO) 


,( 0 ) 

JO) 



,( 0 ) 


> 0 , 


E M 


(30) 


be an infinite Jacobi matrix with A:-periodic entries 


a® -a® 


Cil = €>, n e 


(31) 


and a = (oq^^ ..., b = (6g'^'’, ..., 6 )e(^) be two real vectors of order k, which 

define completely the entries of the whole matrix In that case the principal mx m block 


,(o) 


do) ,( 0 ) 


(0) 


of in (1301) is denoted more explicitly by Jm^ = Jm [a, b]. 

Let 0(a, b,t) be the Hermitian matrix-valued trigonometric polynomial of the form 


( 0 ) _ 


e{a,h,t) = Jfc[a,b] -h 


0 

0 


0 


exp(—ii) 
14 


0 

0 


exp (it) 
0 

0 


(32) 








It is a matter of simple computation to verify that 9 has only three nonzero Fourier coefficients 
9o and 9±i, 


Tn{e{a,h)) 


9o 0-1 O 

01 00 

••• Li 

O 01 00 


Jkn t)]) 


and in general Tm(0(a, b)) = Jm[a, b] for all m. So the asymptotic distribution for the periodic 
Jacobi matrix sequence is a particular case of Theorem 14.41 with h = 0(a, b). Such asymptotic 
distribution, paraphrased as the asymptotic distribution of the zeros of orthogonal polynomials 
Pn with periodic recurrence coefficients, is well known (see, e.g., [101 Section 3] and references 
therein). The essential range 5'(0(a, b)) is tightly related to the support of the corresponding 
orthogonality measure (cf. |11[ Theorem 13]). If /c = 1, then 0(a,b) = 6® + cost and 
putting = 0 and = 1 we come to the Toeplitz matrix O- 

As in the scalar case {k = 1), we are interested in generic complex perturbations of . 
An infinite complex Jacobi matrix 


To. 


00 Cl 
Oi 01 C2 

02 02 C3 


On 1 0n)Cn £ C 


(33) 


is called the Cesdro asymptotically k-periodic if 

i=i 

the asymptotically k-periodic if 

lim (|an - 0^1 + |0n - 0i°^| + |cn - o^°^|) = 0, 
n—>oo 

and the trace class asymptotically k-periodic if 

n 

limsup^doj - + \hj - + \cj - af^\) < oo, 

n—>co . 1 

1=1 

for some /c-periodic sequences {oi°\0i°^} as in m- In other words, Jqo = + Poo with the 

/c-periodic (jnni) (called the background) and the Cesaro compact (compact, the trace class) 
perturbation Poo- 

The following results can be proved in exactly the same fashion as Corollary 13.71 and Corol¬ 
lary |THJ In the latter case Theorem El comes into play. The point is that the essential range 
S'(0(a, b)) is now a union of at most k disjoint closed intervals, and all the eigenvalues of Jn^ 
(the zeros of orthogonal polynomials pn'^ but finitely many (at most 2k), lie in S{6{a, b)). 

So, in particular, the matrix sequence {Tn°^} is strongly clustered at S'(0(a, b)). 

Theorem 4.5. Let Joe be the Cesdro asymptotically k-periodic Jacobi matrix with the back¬ 
ground J^^ and 0(a, b) (I.V^I) the generating function for J®. Then {Jn} is distributed as 
(0(a, b), [—TT, vrj) in the sense of eigenvalues, weakly clustered at S'(0(a, b)), and S{0{a,h)) 
strongly attracts the spectra of {Jn} with infinite order of attraction for any of its points. 


15 






Theorem 4.6. Let J^o he the trace class asymptotically k-periodic Jacobi matrix with the back¬ 
ground and 0(a, b) (1,9^1) the generating function for . Then {Jn} is strongly clustered at 
^(^(a, b)), and 5(0(a, b)) strongly attracts the spectra of {Jn} with infinite order of attraction 
for any of its points. 


5 Concluding remarks and further generalizations 

As a conclusion, we observe that tools from matrix theory El El combined with those from 
asymptotic linear algebra EHHEHn] have been crucial for proving plainly results concerning 
non Hermitian perturbation of Jacobi matrix sequences. A special part of them is the GLT 
theory (see unmsi and references therein) which allows to treat the case of variable coefficients 
under very mild restrictions on the regularity of the coefficients (e.g. numerical approximations 
of variable coefficient PDEs m and systems of PDEs cni, Jacobi sequences with asymptotically 
varying periodic El and non-periodic m coefficients, etc.). The interesting fact is that the tools 
explicitly developed here are applicable verbatim to these cases as well, by allowing to deal with 
non Hermitian perturbations under the same mild trace conditions. 


6 Appendix. Equivalence of trace-norm and entry-wise condi¬ 
tions 


Let A = {aj^k}^j^=i be a complex matrix of size n, let || • ||i be the trace-norm, and let || • ||[i] be 
the componentwise l^ norm: 


j=l j,k=l 

with (Ti > (T 2 > • • • (Tn > 0 being the singular values of A. With the notations (i) and (ii) 
at the end of Section ^ we would like to prove that HPnlli = o{n) if and only if © holds 
and ||Pn||i = 0(1) if and only if 0 is satisfied. Taking into account the definition of the 
norm || • ||p] and the tridiagonal structure of P„, n > 1, condition 0 can be rewritten as 
ll-fn||[i] = o(n) and, similarly, (|HI) is equivalent to ||Pn||[i] = 0(1). Therefore what we would 
like to prove is the asymptotic equivalence, independently of the size n, of the two norms || • ||i 
and II • ||[i]. Specifically, we look for two positive constants c and C independent of n such that 
c||A||i < ||A||p] <0||A||i for every complex matrix A of size n. For a fixed n, the existence of the 
two positive constants c = c(n) and C = 0(n) is trivial thanks to the topological equivalence of 
norms in any finite dimensional vector space. The nontrivial part is to show that c and C can be 
chosen independently of n. Unfortunately, the latter is in general false as the following example 
shows. Take A = ^=1 with aj^k = I, Vj, A: = 1,..., n. Then ai = n, a 2 = ■ ■ ■ = CTn = 0, 

and therefore ||A||i = n while ||A||p] = so that C{n) > n (indeed it can be proved that the 
previous example is an extremal one and indeed the best constant C is exactly C{n) = n). 

Therefore the equivalence of the trace-norm and of the l^ entry-wise norm has to exploit the 
fact that the involved matrices are tridiagonal. In the subsequent steps we will use the Fourier 
analysis of matrices introduced by Bhatia in |^. Let A be a generic tridiagonal matrix of size 
n and, for any m = 1 — n,..., n — 1, let Um(A) be the matrix which coincides with the m-th 
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diagonal of A, i.e., ^ = aj^k 'A j — k = m and {'Dm{A)}j = 0 otherwise. Therefore 


1 

A= ^ V^{A) 

m=—l 


(34) 


and, by the structure of any DmiA), a plain check shows that 

\\V^{A)\\^i] = \\Vm{A)h. (35) 

Consequently, by the definition of || • ll[i], (El), and ESI) we have 

1 1 

Mil, = II V^{A)h < E IPm(-4)lli 

m=—1 m=—l 

= ^ I|I5„M)|||1, = ||/1|||1, 

m =—1 

and so c = 1 which is independent of n. For the reverse inequality we have 

1 1 

ll^ll[l] = II (Pm(^)ll[l] = l|(^m(^)||[l] 

m=—1 m=—1 

1 

= p^(A)||i<3P||i, 

m=—1 

where for the last inequality we use the identity (see [1]) 

1 r 

Vm{A) = — / D{t)AD*{t)ex.p{—imt)dt, 

2 vr 7-,r 

with D{t) a diagonal unitary matrix whose j-th diagonal entry equals exp(i(j — l)t). From the 
latter identity, since the trace-norm is a unitarily invariant norm (see [3]), it easily follows that 
\\^m{A)h < ll^lli- We conclude that C = 3 which is again a constant independent of n, as 
desired. 

As already pointed out in the introduction only the proof is new. The result can be recovered 
directly from known facts: for instance use inequalities (2.32) in jH] with p = 1 and the (trivial) 
equivalence between l°° and norms for vectors of size 3. Then one arrives to 

^I|A||i<||.4||[i|<9||A||i 

for every tridiagonal matrix A. Note however that our constants c = 1 and C = 3 are tighter 
and indeed c = 1 is optimal (take A the identity matrix). 

Finally, it should be remarked that the similar equivalence results can be obtained for 
more general patterns. Instead of tridiagonal structures we could equally well have considered 
banded structures (also in a multilevel sense, see [H]). In that case, the proofs are identical and 
the constants are c = 1 and C equals to the number of nonzero diagonals of the considered 
band matrices. As long as this number is independent of n, the two norms || • ||[i] and || • ||i 
are asymptotically equivalent, i.e., with equivalence constants positive and independent of the 
size n. 
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